Towards a Calibrated Corpus for Compression Testing
نویسندگان
چکیده
A mini-corpus of twelve ‘calibrated’ binary-data files have been produced for systematic evaluation of compression algorithms (Available at http : //www.tcode.auckland.ac.nz/-mark/corpus). These are generated within the framework of a deterministic theory of string complexity [2]. Here the T-complexity of a string z (measured in taugs) is defined as CT(Q) = Ci logz(ki + 1): where the positive integers k, are the T-expansion parameters for the corresponding string production process outlined in [l] . C ( ) T z 1s o bserved in [2] to be the Logarithmic Integral of the total information content Iz, of z (measured in nats), i.e., CT(X) = li(1,). T he average entropy is HZ= I,/jxI, i.e.: the total information content divided by the length of 2. Thus CT(X) = li(H, ~1x1). Alternatively, the information rate along a string may be described by an entropy function Hz(n), 0 < n L: 1x1 for the string [3]. Assuming that Hz(n) is continuously integrable along the length of the 2, then I, = $“’ H,(n)&. T h u s CT(X) = li (JJ”’ H,(n)&). So ving for Hz(n): that is differentiating 1 both sides and rearranging, we get: ~G(4n) Hz(n) = Sn x &I, @-l (w4?J)) (1)
منابع مشابه
Development of a compression system dynamic simulation code for testing and designing of anti-surge control system
In recent years, several research activities have been conducted to develop knowledge in analysis, design and optimization of compressor anti-surge control system. Since the anti-surge control testing on a full-scale compressor is limited to possible consequences of failure, and also the experimental facility can be expensive to set up control strategies and logic, design process often involves...
متن کاملViscoelastic parameter identification of human brain tissue.
Understanding the constitutive behavior of the human brain is critical to interpret the physical environment during neurodevelopment, neurosurgery, and neurodegeneration. A wide variety of constitutive models has been proposed to characterize the brain at different temporal and spatial scales. Yet, their model parameters are typically calibrated with a single loading mode and fail to predict th...
متن کاملFinite State Models for the Generation of Large Corpora of Natural Language Texts
Natural languages are probably one of the most common type of input for text processing algorithms. Therefore, it is often desirable to have a large training/testing set of input of this kind, especially when dealing with algorithms tuned for natural language texts. The problem in creating good corpora is that often natural language texts are too short with respect to the dimension required to ...
متن کاملAttitude of Health Care Professionals Towards Voluntary Counseling and Testing for HIV/AIDS
Introduction: HIV counseling and testing is the vital and preliminary interventional step aimed at reducing the spread of HIV infection. The study was designed to determine the attitude of health care professionals towards voluntary counseling and testing (VCT) for HIV/AIDS at Irrua Specialist Teaching Hospital. Materials & Methods: In this descriptive cross sectional prospective study a sel...
متن کاملSubstructure Model for Concrete Behavior Simulation under Cyclic Multiaxial Loading
This paper proposes a framework for the constitutive model based on the semi-micromechanical aspects of plasticity, including damage progress for simulating behavior of concrete under multiaxial loading. This model is aimed to be used in plastic and fracture analysis of both regular and reinforced concrete structures, for the framework of sample plane crack approach. This model uses multilamina...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1999